Making the Shortest-paths Approach to Sum-of-pairs Multiple Sequence Alignment More Space Eecient in Practice

نویسندگان

  • Sandeep K. Gupta
  • John Kececioglu
چکیده

The MSA program is one of the few existing programs that attempts to nd optimal alignments of multiple protein or DNA sequences. The MSA program was written and distributed in 1989. It implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage of MSA. The improvements make feasible a variety of problem instances that were not feasible previously. On some runs we achieve an order of magnitude reduction in space usage and a signi cant multiplicative factor improvement in running time. To explain how the improvements work, it is necessary to give a much more detailed description of MSA than has been previously published. In practice, MSA rarely produces a provably optimal alignment; the reasons for lack of optimality are discussed at length in the body of the paper. 2

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving the Practical Space and Time Efficiency of the Shortest-Paths Approach to Sum-of-Pairs Multiple Sequence Alignment

The MSA program, written and distributed in 1989, is one of the few existing programs that attempts to find optimal alignments of multiple protein or DNA sequences. The MSA program implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage o...

متن کامل

Improving the Practical Space and Time Eeciency of the Shortest-paths Approach to Sum-of-pairs Multiple Sequence Alignment

The MSA program, written and distributed in 1989, is one of the few existing programs that attempts to nd optimal alignments of multiple protein or DNA sequences. The MSA program implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage of ...

متن کامل

Optimal Alignment of Multiple Sequence Alignments

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 CHAPTER 1: INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . 14 1.1 Perspective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 1.2 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 1.3 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ...

متن کامل

Improving the Divide - and - ConquerApproach to Sum - of - PairsMultiple Sequence

|We consider the problem of multiple sequence alignment: given k sequences of length at most n and a certain scoring function, nd an alignment that minimizes the corresponding \sum of pairs" distance score. We generalize the divide-and-conquer technique described in 1,2], and present new ideas on how to use eecient search strategies for saving computer memory and accelerating the procedure for ...

متن کامل

An Application of the ABS LX Algorithm to Multiple Sequence Alignment

We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995